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(54) An audio memo system and method of operation thereof 



(57) The hands-free audio memo system (10) and 
method uses an interface (12) to receive a voice input 
from a user, and a speech recognition unit (18) coupled 
to the interface (12) to monitor the voice input and rec- 
ognize a predetermined set of voice commands from the 
voice input. The speech recognition unit (18) generates 
a command signal that corresponds to the recognized 



voice command, which is received by a controller unit 
(20). The controller unit (20) activates a speech acqui- 
sition unit (16) coupled to the controller unit (20) to col- 
lect and stop collecting the voice input in response to a 
control signal generated by the controller unit (20). a 
memory (24) is provided to store the collected voice in- 
put. 
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Description 

TECHNICAL FIELD OF THE INVENTION 

This invention is related in general to the field of per- 
sonal electronic systems. More particularly, the inven- 
tion is related to an audio memo system and method of 
operation thereof. 

BACKGROUND OF THE INVENTION 

It is common knowledge that we are currently living 
in the Information Age. Data comes to us in visual, au- 
dio, and written forms through a myriad of channels: ra- 
dio, telecommunications, television, internet, world wide 
web, and just plain seeing, hearing, and feeling things 
as events occur around us. There are many instances 
when it is desirable to retain some of the information in 
a more reliable manner than the ability or inability to re- 
call data we are born with. For example, a telephone 
number announced on the radio, the location of a spe- 
cialty store, or an ingenious idea about a novel gadget 
to solve a stubborn problem. 

The old standby to record data is the pen and paper. 
However, there are times when it is inconvenient to 
write, such as when one is operating an automobile, or 
when pen and paper are not accessible. 

Dictaphones, which use audio tape cassettes, and 
some newer digital recorders, have been used to fill this 
void. However, they all require the use of at least one 
hand to hold the device, and to operate one of many 
buttons to turn "ON" the device, record, retrieve, erase, 
and turn "OFF" the device. Further, because it has been 
shown that the use of one hand to handle a wireless 
telephone while operating an automobile can lead to un- 
safe driving and possibly higher incidents of traffic acci- 
dents, it is less than desirable to also require the driver 
to devote the use of one hand to operate the recording 
device. 

SUMMARY OF THE INVENTION 

Accordingly, there is a need for an audio memo sys- 
tem which enables hands-free operation. 

In accordance with the present invention, a hands- 
free audio memo system and method of operation there- 
of are provided which eliminate or substantially reduce 
the disadvantages associated with prior devices. 

In one aspect of the invention, an audio memo sys- 
tem is provided that uses an interface for receiving a 
voice input from a user, and a speech recognition unit 
coupled to the interface for monitoring the voice input 
and recognizing a predetermined set of voice com- 
mands from the voice input. The speech recognition unit 
generates a command signal that corresponds to the 
recognized voice command, which is received by a con- 
troller unit. The controller unit activates a speech acqui- 
sition unit coupled to the controller unit for collecting and 



or stopping collection of the voice input in response to 
a control signal generated by the controller unit. A mem- 
ory is provided for storing the collected voice input. 
In another aspect of the invention, the personal 
5 memo system includes an analog interface for receiving 
a voice input from a user, a speech recognition unit cou- 
pled to the interface and adapted for recognizing a pre- 
determined set of voice commands from the voice input, 
and for generating a command signal in response there- 
to to. A controller unit is coupled to the speech recognition 
unit which generates a control signal in response to re- 
ceiving the command signal from the speech recogni- 
tion unit. A digital telephone answering device is cou- 
pled to the controller unit and analog interface for col- 
15 lecting and storing the voice input. 

In yet another aspect of the invention, a method for 
operating an audio memo system includes the steps of 
receiving a voice input from a user, recognizing voice 
commands in the voice input indicative of the user's de- 
20 sire to record an audio memo, collecting subsequent 
voice input, and storing the subsequent voice input. 

The audio memo system of the present invention 
provides a way for users to record audio memos and 
perform other functions without the use of a hand for its 
25 operation. This is especially advantageous for persons 
who are operating an automobile or performing other 
tasks that require concentration and generally the use 
of both hands. 



For a better understanding of the present invention, 
reference may be made to the accompanying drawings, 
in which: 



35 

FIGURE 1 is a simplified functional block diagram 
of an exemplary hands-free audio memo system 
constructed according to the teachings of the 
present invention; 

40 FIGURE 2 is a simplified block diagram of an alter- 
native embodiment of the hands-free audio memo 
system of the present invention; 
FIGURE 3 is an exemplary flowchart of a simplified 
hands-free audio memo algorithm according to the 

45 teachings of the present invention; 

FIGURE 4 is an exemplary flowchart of an hands- 
free audio memo algorithm according to the teach- 
ings of the present invention; 
FIGURE 5 is an exemplary flowchart of voice play- 

50 back and memo management functions of the 
hands-free audio memo algorithm according to the 
teachings of the present invention; and 
FIGURE 6 is an exemplary flowchart showing ex- 
emplary voice inputs to the system according to the 

55 teachings of the present invention. 
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the timer expires, speech acquisition is stopped, as 
shown in step 88. Controller unit 20 or 58 is then notified 
that memo recording terminated, as shown in step 90, 
and execution returns to step 74 to be ready for the next 
memo. 5 

A further, alternative method for hands-free audio 
memo 100 is shown in FIGURE 4. Speech acquisition 
16 or DTAD 56 and speech recognition 18 or 54 are ac- 
tivated either by starting the automobile, leaving the key 
in the accessory position, or the push of a button (not io 
shown), for example, as shown in step 1 02. Speech rec- 
ognition 1 8 or 54 monitors the speech uttered by the us- 
er(s) in the vicinity and searches for recognizable valid 
voice commands and/or responses, such as a com- 
mand to start the memo process, as shown in step 104. is 

When it is determined that the captured voice input 
is a valid command, such as "MEMO START," controller 
unit 20 or 58 is notified, as shown in steps 106 and 108. 
Controller unit 20 or 58 in turn activates the memo func- 
tion, as shown in step 1 1 0. In step 1 1 2, an audio prompt 20 
or speech (e.g., "MEMO SYSTEM READY") may be 
generated to signal to the user that he/she may begin 
to speak. The user's speech is then captured and com- 
pared with recognizable commands appropriate for the 
circumstances, such as "MEMO TERMINATE" to end 25 
the process, as shown in steps 114 and 116. Speech 
recognition 18 or 54 may be running in a low resource 
mode at this time to look for only those commands that 
are valid during this time, such as only the command to 
terminate or pause the memo taking process. If the cap- 30 
tured utterance is not a recognizable and valid com- 
mand, then it is collected, compressed, and stored, as 
shown in step 118. If in step 116, it is determined that 
the captured speech is a recognizable and valid com- 
mand to end the memo process, for example, then con- 35 
troller unit 20 or 58 is notified, as shown in block 120. 
Controller unit 20 or 58 then pauses speech acquisition, 
as shown in step 122, and instructs speech decompres- 
sion 26 and speech synthesis 27 to issue an audible 
prompt for confirmation, such as "READY TO TERMI- 40 
NATE MEMO?" The subsequent voice input is then cap- 
tured and monitored for a valid response to the prompt, 
such as "YES" or "NO," as shown in steps 126 and 128. 
If the received voice input is not a recognizable valid 
response to the confirmation, then an appropriate audio 45 
response may be generated to reconfirm, as shown in 
step 132. If the voice input is recognized as a response 
indicative that the user is not ready to terminate the 
memo process, then execution returns to step 112, to 
continue to record memo. If on the other hand the voice so 
input is recognized as an affirmative response in step 
1 30, then the memo function is stopped in step 1 34, and 
controller unit 20 or 58 is notified in step 1 36. Execution 
then returns to step 104 to prepare for the next memo. 

FIGURE 5 is a flowchart of memo playback and 55 
memo management functions of system 10 and 50. At 
step 76 shown in FIGURE 3 or step 106 shown in FIG- 
URE 4, if the voice input is not a valid start command, it 



is also checked for whether it is a valid playback com- 
mand, as shown in step 140. If it is, controller unit 58 is 
notified in step 1 42 and the user is prompted for addi- 
tional input, which is captured, as shown in step 144. 
The captured speech input is then examined to deter- 
mined whether it is a valid response to the prompt given 
in step 144, if not, some appropriate action is taken in 
step 148, such as issue an appropriate audio statement. 
If it is a valid response, then the memo playback function 
1 50 is launched, where the user may play back one or 
more previously recorded memos, skip one or more 
memos, etc. At the end of the memo playback function, 
the algorithm may return to step 74 in FIGURE 3 or step 
114 shown in FIGURE 4. 

If in step 140 it is determined that the speech input 
is not a valid playback command, then a determination 
is made as to whether it is a valid memo management 
command in step 152. If not, then the process may re- 
turn to step 74 in FIGURE 3 or step 114 shown in FIG- 
URE 4 to continue to capture the speech input. Other- 
wise, controller unit 58 is notified in step 154 and the 
user is prompted for additional input, which is captured, 
as shown in step 1 56. The captured speech input is then 
examined to determined whether it is a valid response 
to the prompt given in step 1 58, if not, some appropriate 
action is taken in step 1 48, such as issue an audio state- 
ment. If it is a valid response, then the memo manage- 
ment function 160 is launched, where the user may per- 
form operations such as delete, save, and protect on 
previously recorded memos. At the end of the memo 
management function, the algorithm may return to step 
74 in FIGURE 3 or step 114 shown in FIGURE 4. 

Referring to FIGURE 6, a more detailed process 
flow is shown. As voice input is captured in step 170, it 
is determined whether it matched any recognizable and 
valid command and response in step 172. For example, 
one or more recognized key phrases may be used to 
initiate system 50 in a memo recording mode 180, 
memo playback mode 182, memo management mode 
184, dialer mode 186, and voice mail mode 188, where 
each mode is shown with exemplary valid phrases rec- 
ognized when system 50 is in the respective modes. The 
key phrases to launch each mode may include "MEMO 
START" to launch the memo recording functions; 
"MEMO PLAYBACK" to launch the memo playback 
functions; "MEMO MANAGEMENT" to launch the 
memo management functions; "CALL X" to launch the 
dialer functions; and "GET MAIL" to launch the voice 
mail functions. Thus, speech recognition unit 54 need 
only to focus on a subset of possible valid utterances as 
to speed up search and processing time and to conserve 
resources. 

Although the present embodiment and its advan- 
tages have been described in detail, it should be under- 
stood that various changes, substitutions and altera- 
tions can be made therein without departing from the 
spirit and scope of the invention. 
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Claims 

1. An audio memo system, comprising: 

an interface for receiving a voice input from a 
user; 

a speech recognition unit coupled to the inter- 
face and adapted for recognizing a predeter- 
mined set of voice commands from the voice 
input, the speech recognition unit further being 
arranged for generating a command signal as- 
sociated with the recognized voice command; 
a controller unit coupled to the speech recog- 
nition unit for generating a control signal in re- 
sponse to receiving the command signal from 
the speech recognition unit; 
a speech acquisition unit coupled to the con- 
troller unit and the interface for collecting or ter- 
minating collection of the voice input in re- 
sponse to the control signal; and 
a memory unit coupled to the speech acquisi- 
tion unit for storing the collected voice input. 

2. The system as set forth in Claim 1 , further compris- 
ing; 

a speech compression unit coupled to the 
speech acquisition unit for compressing the collect- 
ed voice input prior to storing in the memory unit. 

3. The system, as set forth in Claim 1 or Claim 2, fur- 
ther comprising; 

a speech synthesis unit coupled to the inter- 
face and speech recognition unit for generating a 
predetermined set of audio confirmations in re- 
sponse to the voice input. 

4. The system as set forth in any of Claims 1 to 3, 
wherein the controller unit comprises a communi- 
cations link coupled to a personal computer for 
downloading stored voice inputs from the memory 
unit thereto. 

5. The system as set forth in any of Claims 1 to 4, 
wherein the controller unit comprises a communi- 
cations link coupled to a personal communication 
system for dialing a telephone number in response 
to a dialing command recognized by the speech 
recognition unit. 

6. A method for operating an audio memo system, 
comprising the steps of: 

receiving a voice input from a user; 
recognizing a voice command in the voice input 
indicative of the user's desire to record an audio 
memo; 

collecting subsequent voice inputs; and 
storing the subsequent voice inputs. 



7. The method as set forth in Claim 6 further compris- 
ing the step of; 

generating an audio confirmation in response 
to recognizing the voice command. 

5 

8. The method as set forth in Claim 7, wherein the au- 
dio confirmation generating step further comprises 
the step of synthesizing a speech signal. 

10 9. The method, as set forth in any of Claims 6 to 8, 
further comprising the steps of: 

recognizing a voice command in the subse- 
quent voice inputs that is indicative of the user's 
15 desire to stop recording the audio memo; and 

stopping collection of subsequent voice inputs. 

10. The method, as set forth in Claim 9, further com- 
prising the step of: 

20 

prior to said step of stopping collection of sub- 
sequent voice inputs generating an audio con- 
firmation in response to recognizing the voice 
command; and 

recognizing a confirmation in the voice input. 

11. The method, as set forth in any of Claims 6 to 8, 
further comprising the steps of: 

recognizing a voice command in the subse- 
quent voice inputs that is indicative of the user's 
desire to stop recording the audio memo; 
generating an audio confirmation in response 
to recognizing the voice command; 
recognizing a denial in the voice input; and 
continuing collection of subsequent voice in- 
puts. 

12. The method as set forth in any of Claims 6 to 11, 
further comprising the steps of: 

starting a timer for a predetermined time period 
after the voice command recognizing step; and 
stopping collection of subsequent voice inputs 
when the timer expires. 

13. The method as set forth in any of Claims 6 to 8, fur- 
ther comprising the steps of: 

converting the voice input to a digital voice in- 
put; and 

compressing the digital voice input prior to stor- 
ing thereof. 

14. The method, as set forth in any of Claims 6 to 8, 
further comprising the steps of: 

recognizing a voice command in the voice input 
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indicative of the user's desire to playback 
stored voice input; and 

playing back stored voice input in response to 
further voice input. 

5 

15. The method as set forth in any of Claims 6 to 8, fur- 
ther comprising: 

recognizing a voice command in the voice input 
indicative of the user's desire to manage stored io 
voice input; and 

performing management functions in response 
to further voice inputs. 

16. The method, as set forth in any of Claims 6 to 8, 15 
further comprising the steps of: 

recognizing a voice command in the voice input 
indicative of a specific function; and 
focusing on a subset of subsequent valid voice 20 
commands in response thereto. 

17. A personal memo system comprising: 

an analog interface for receiving a voice input 2s 
from a user; 

a speech recognition unit coupled to the inter- 
face and adapted for recognizing a predeter- 
mined set of voice commands from the voice 
input, the speech recognition unit further being 30 
arranged for generating a command signal as- 
sociated with the recognized voice command; 
a controller unit coupled to the speech recog- 
nition unit for generating a control signal in re- 
sponse to receiving the command signal from 35 
the speech recognition unit; and 
a digital answering device coupled to the con- 
troller unit and analog interface for collecting 
and storing the voice input. 

40 

18. The system as set forth in Claim 17, wherein the 
controller unit comprises a communications link 
coupled to a personal computer for downloading 
stored voice inputs from the memory unit thereto. 

45 

19. The system as set forth in Claim 17, wherein the 
controller unit comprises a communications link 
coupled to a personal communication system for di- 
aling a telephone number in response to a dialing 
command recognized by the speech recognition so 
unit. 
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